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Abstract 

We define a family of natural decompositions of Sturmian words in Christoffel 
words, called reversible Christoffel (RC) factorizations. They arise from the ob- 
servation that two Sturmian words with the same language have (almost always) 
^ ■ arbitrarily long Abelian equivalent prefixes. Using the three gap theorem, we prove 

that in each RC factorization, only 2 or 3 distinct Christoffel words may occur. 
We begin the study of such factorizations, considered as infinite words over 2 or 
^ ■ 3 letters, and show that in the general case they are either Sturmian words, or 

0^ . obtained by a three-interval exchange transformation. 

■ Keywords: Sturmian words; Christoffel words; RC factorizations; Three-interval 
. exchange transformation. 
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2: 1 Introduction 
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In combinatorics on words and symbolic dynamics, it is often meaningful to look at two 
infinite words w, w' and determine the segments where they coincide, that is, locate 
! maximal occurrences of factors u such that w = pus, w' = p'us' for some words p and 

p' of equal length and some infinite words s, s'. 

If w and w' are two fixed points of an irreducible Pisot substitution (f, the strong 
coincidence conjecture (proved by Barge and Diamond [sl in the binary case) states that 
there exists a letter a and two factorizations 

w = pas, w' = pas' (1) 

such that p and p' are Abelian equivalent, i.e., an "anagram" of each other. This has two 
remarkable consequences: 
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1. w and w' agree on arbitrarily long segments (as defined above), 



2. w and w' have arbitrarily long Abelian equivalent prefixes. 

Indeed, from ([T]) it follows w = y9"(p)(y9"(a)v9"(s) and w' = (y9"(p')(y9"(a)v9"(s') for all 
n > 0. 

In general, two words satisfying condition [1] above are said to be proximal; we say that 
two infinite words w,w' are Abelian comparable if condition [2] is satisfied. This induces 
two factorizations (comparison) 



w = XiX2 ■■■Xn--- , , 

W — X^X2 ■ ■ - X^ - ■ ■ 



defined so that each pair of Abelian equivalent prefixes of w and w' can be written as 
) for some A; > 0; equivalently, for all > 0, and x'^. are the shortest 
nonempty Abelian equivalent prefixes of the infinite words XkXk+i ■ ■ ■ and a^'^a;'^^^ ■ ■ ■ . 

In this paper (a preliminary version of which was presented at the first RuFiDiM f9|), 
we look at Sturmian words over A = {0, 1} from a similar point of view. We recall 
that an infinite word over A is Sturmian if it has exactly n + 1 distinct factors of each 
length n > 0. The first systematic study of Sturmian sequences is usually credited to 



Morse and Hedlund jl3|, from the point of view of symbolic dynamics. In fact, every 
Sturmian word can be realized either as a lower mechanical word or as an upper one. 
The lower (resp. upper) mechanical word s^^p (resp. s^^ ^) of slope a and intercept p, with 
< a, p < 1, is the infinite word indexed over N whose ?7,-th letter is if 

{na + p} < 1 — a (resp. if < {na + p} < 1 — a) 

and 1 otherwise (denoting by {a} the fractional part a — [a\ of the real number a). Thus 
mechanical words encode rotations by angle 2Tia on a circle, and a gives the frequency 
of the letter 1 in the infinite word. Moreover, the slope determines the language (set 
of factors). Among binary words, mechanical words are characterized by the balance 
property: the number of occurrences of the letter 1 in two factors of the same length 
may differ at most by 1. 

When a ^ Q, the word So,a = =: c is said to be the characteristic (or standard) 
Sturmian word of slope a. Its prefixes are exactly all left special factors of the Sturmian 
words of slope a, i.e., p G Pref (c) if and only if Op, Ip G Fact(sQ, p) for any p. We say that 
a Sturmian word is singular if it contains the characteristic word (of the same slope) as 
a proper suffix. By definition, we have 

Safl = Oc and ^ = Ic ; 

any other singular Sturmian word can be written as pOlc or plOc, where p G Pref(c). It 
is easy to see that every nonsingular Sturmian word is both an upper mechanical word 
and a lower one. 
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Mechanical words of irrational slope are exactly all Sturmian words, whereas those 
of rational slope are periodic words; when a G Q, the shortest v such that Sq, q = 
(resp. s'ao = v^) is the lower (resp. upper) Christoffel word of slope a. It is well known 
(cf. 0) that the set of lower (resp. upper) Christoffel words can be characterized as 
A U OPl (resp. A U IPO), where V is the set of central words, i.e., words u such that 
both OmI and ImO are factor of some Sturmian word s. Central words are exactly all 
palindromic prefixes of characteristic Sturmian words. The following characterization of 
central words is well known. 



Proposition 1.1 (de Luca [ll|). A word u & A* is central if and only if it is a palin- 
drome satisfying one of the following conditions: 

1. u is a power of a letter, z.e., m G 0* U 1*, or 

2. u = pOlg = qlQp for some palindromes p,q & A* . 

In the latter case, the words p, q are central too, and uniquely determined; one of them 
is the longest palindromic prefix (and suffix) of u. 

By iterated application of the previous result, one easily obtains: 

Corollary 1.2. Let v be a proper palindromic prefix of a central word u, such that 
f ^ 0* U 1*. Then either fOl or f 10 is a prefix of u. 

Since central words are palindromes, we have that v = Oul is a lower Christoffel 
word if and only if its reversal v = luO is an upper Christoffel word. We recall that 
any nontrivial (i.e., longer than a letter) Christoffel word can be uniquely written as a 
product of two (shorter) Christoffel words. All pairs {u, v) of lower Christoffel words 
such that uv is Christoffel make up the Christoffel tree (cf. Berstel and de Luca [4]); the 
pair (0,1) is its root^, and every node (u^v) has the two children {u,uv) and (uv,v). 
Moreover, all lower Christoffel factors of an infinite Sturmian word are found on an 
infinite path on the tree. Thus, in particular, the following properties hold: 

Proposition 1.3. Let u,v,z be Christoffel words such that z = uv, and let w be a 
Sturmian word with z G Fact(w). Then: 

1. u^v and uv"^ are Christoffel words too, and exactly one of them is a factor ofw; 

2. if {u,v} 7^ {0, 1}, then either u is a prefix of v, or v is a suffix of u. 

For more information on Sturmian and Christoffel words, we refer the reader to [1, @|. 

In the next sections, we shall deal with the comparison of Sturmian words. We 
begin by proving (Proposition 12. ip that all pairs of Sturmian words of the same slope, 
except one, are Abelian comparable. Our main result (Theorem 13. ip shows that for 
Abelian comparable Sturmian words w and w', the factorizations in ^ have at most 
3 distinct terms, i.e., the sets {xi}i>o and {a;^}j>o have cardinality 2 or 3; furthermore, 
all such terms are Christoffel words. Finally, we shall examine the structure of these 
factorizations. 



^Using (1,0) instead, all upper Christoffel pairs are obtained. 



3 



2 Comparison of Sturmian words: RC factorizations 



Trivially, if two Sturmian words are Abelian comparable then they have the same slope 
(and hence the same language). The following proposition shows that the converse holds 
too, with a single exception. 

Proposition 2.1. Let w, w' be Sturmian words of slope a, and c = Sa,a be the charac- 
teristic word. //{w,w'} 7^ {Oc, Ic}, then w and w' are Abelian comparable. 

Proof. Let Xi and x[ be the shortest nonempty prefixes (of w and w' respectively) which 
are Abelian equivalent. These are well defined; in fact, suppose by contradiction that w 
and w' have no Abelian equivalent prefixes except e. By the balance property, it follows 
{w, w'} = {Ot, It} for some infinite word t; as all prefixes of t are left special, we get 
t = c, contradicting our hypothesis. 

Let then w = XiWi and w' = x'iw[ for some Sturmian words Wi, w'^^ having the same 
language as w and w'. We have {wi,w']^} 7^ {Oc, Ic}, for otherwise we would obtain 
{w, w'} = {plOc,p01c} for some p G Pref(c), and then {xi,x[} = {pl,pO}, which is 
absurd as Xi and x[ are Abelian equivalent. Hence we can iterate this argument to get 
infinitely many Abelian equivalent prefixes of w and w'. □ 

Example 2.2. Let a = (3 — -\/5)/2. The Fibonacci word 

f = s„,„ = 010010100100101001010010010100100101001010010010100101001001 ■ ■ ■ 

is the most famous Sturmian word. The Abelian comparison of the words f and f : = 
Sa,4/5 is: 
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Let w, w' be two Sturmian words having the same language, and suppose {w, w'} 7^ 
{Oc, Ic} where c is the characteristic word with the same slope. By Proposition 12. H w 
and w' are then Abelian comparable; let their comparison be given by ([2]). 

By definition, Xi and are Abelian equivalent for alH > 1. By the balance property, 
it follows either Xi = x[ & A, or {xi,x[} = {Owl, 1^0} for some factor u of w, which is 
then a central word. We conclude that in all and x'j^ are Christoffel words, with 

x- = aTj. Thus we can write: 

W = Xl Xo ' ' ' Xrt ' * * , , , 

, IZ Z (3) 

W = X\X2 ■■ - Xn-- ■ ■ 

Conversely, if (x„)„>o is a sequence of Christoffel words such that both infinite words 
in ([3]) are Sturmian, then the Abelian comparison of w and w' yields exactly the same 
factorizations. 

This motivates the following definition: we call reversible Christoffel (RC) factor- 
ization of a Sturmian word w any infinite sequence {xk)k>o of Christoffel words such 
that 
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1. w = X1X2 ■ ■ ■ x„ ■ ■ ■ , and 

2. w' := X1X2 ■ ■ -x^ - ■ ■ is a Sturmian word. 

We also say that {xk)k>Q is the RC factorization of w determined by w'. 

A trivial RC factorization is obtained by choosing all x^'s to be single letters, so that 
w' = w. The definition implies that every choice of w' determines a distinct factorization 
of w; this proves the following statement. 

Proposition 2.3. Every Sturmian word admits uncountahly many distinct RC factor- 
izations. 

The following lemma is an immediate consequence of the balance property: 

Proposition 2.4. Let w he a Sturmian word, and {xk)k>Q be any RC factorization of 
w. Then the terms Xk, k > 1, are either all upper Christoffel words, or they are all 
lower Christoffel words. 

In the remainder of this section, we shall use the above definitions and results to give 
a proof of a stronger (and probably known) version of the strong coincidence conjecture 
in the case of Sturmian words, namely Proposition 12.71 below. We need the following 
lemma (a restatement of fiol . Lemma 5.2]): 

Lemma 2.5 (Bucci, Puzynina, and Zamboni flO^). Two distinct Sturmian words w, w' 
are proximal if and only if they can he written as w = qc, w' = q'c for some q, q' with 
\q\ = \q'\; that is, if and only if they contain the characteristic word at the same position. 

As observed above, in such a case the set {q, q'} is either {0, 1} or {p01,plO} for some 
p G Pref(c). 

We recall that a morphism f : A* ^ A* is said to be Sturmian if it maps Sturmian 
words to Sturmian words; as is well known (cf. Q), for a morphism to be Sturmian it 
suffices to map one Sturmian word to another one. Furthermore, Sturmian morphisms 
form a monoid generated by the three substitutions 

E -.0^1,1^0, : 01,1 ^ 0, and : ^^ 10,1 ^ . (4) 

From this, an easy induction argument gives the following property, also well known: 

Lemma 2.6. If f ^ {id, E} is a Sturmian morphism, then one hetween /(O) and /(I) 
is a proper prefix or a proper suffix of the other. 

We also recall that a complete characterization of Sturmian fixed points of morphisms 
in terms of slope and intercept was given by Yasutomi [l3| (see also (7|). 

The following result about coincidence for Sturmian words can be easily proved as a di- 
rect consequence of well known characterizations and properties of Sturmian morphisms. 
We give here a proof based on RC factorizations: 
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Proposition 2.7. Let w, w' be distinct Sturmian words that are fixed points of a non- 
trivial morphism f : A* ^ A* . Then {w, w'} = {01c, 10c}, where c is the characteristic 
word having the same language. 

Proof. It is easy to see tliat w and w' have the same language. Without loss of generality, 
we can assume that w starts with and w' starts with 1. Let us first show that the 
combination w = Oc, w' = Ic is not possible. If /(Oc) = Oc and /(Ic) = Ic, it would 
follow c = A/(c) for some word A, and then |/(0)| = |/(1)|, since c is not ultimately 
periodic. This is impossible in view of Lemma 12.61 

Thus by Proposition 12. H w and w' are Abelian comparable; let w = X1X2 ■ ■ ■ and 
w' = X1X2 • ■ ■ be their corresponding RC factorizations. By contradiction, suppose they 
contain only words of length > 1. By Proposition 12. 4[ for all n > 1 we can write 
Xn = Om„1 for suitable words «„. Since f{xi) is Abelian equivalent to /(xi), there must 
be an m > 1 such that f{xi) = X1X2 ■ ■ ■ Xm- Hence Xi and its image both end with 1, so 
that the word /(I), which clearly starts with 1, ends in 1 as well. As f{xi) = Xi- ■ ■ x^, 
by the same argument it follows that /(O) begins and ends in 0. Again, this contradicts 
Lemma 12.61 

Therefore, there must be some term Xi of length 1 in the RC factorization, that is, ([T]) 
holds. As discussed above, this implies that w and w' are proximal. Since {w, w'} 7^ 
{Oc, Ic}, by Lemma [2.51 it follows {w, w'} = {p01c,plOc} for some p G Pref(c); as w 
starts with and w' with 1, the assertion is proved. □ 

Example 2.8. The words Olf and lOf are both fixed by (f"^, as defined in (jlj). 

3 Main results 

3.1 Terms of RC factorizations 

Our main theorem shows that RC factorizations have at most 3 distinct terms. 

Theorem 3.1. Let w be a Sturmian word, and w = X1X2 • • ■ x„ ■ ■ ■ be an RC factoriza- 
tion ofw. The cardinality of the set X = {x„ \ n > 0} is either 2 or 3, and in the latter 
case, the longest element of X is obtained concatenating the other two. 

The proof relies on the following well known result by Slater deeply related to 
the three distance theorem proved by Sos (isj (see also 1^1]): 

Theorem 3.2 (Three gap theorem). Let a be an irrational number with < a < 1, and 
let < P < 1/2. The gaps between the successive integers n such that {na} < /3 take 
either two or three values, one being the sum of the other two. 

Proof of Theorem IJ.il Since Sturmian words are not periodic, the set X has cardinality 
at least two. Let w' = X1X2 ■ ■ ■ , and suppose first that w and w' are both lower 
mechanical words, so that w = s^^p and w' = Sa^p' for some p,p' G [0, 1[. Without loss 
of generality, we may suppose a < 1/2 (otherwise it suffices to exchange the roles of 
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the letters and 1), and /3 := {p' — p} < 1/2 (otherwise we swap w and w'). Hence 
a < 1 — a and (3 < 1 — (3. 
We distinguish two cases: 

1. If a < /3, then w and w' cannot be 1 at the same time, i.e., there is no n for 
which {na + p} and {na + p'} are both larger than 1 — a. Assuming, in view of 
Proposition 12. 4^ that all terms Xk [k > 1) are lower Christoffel words (the "upper" 
case being similar), this implies X C {0} U {0*''l | k > 0}, since any other lower 
Christoffel word x would have a 1 in the same position as in x. Let ki, and k2 
be positive integers such that i < j , Xi = O'^^l, and Xj = O^^l. This implies, since 
w and w' have the same language, that w has the two factors 

XiXi+i ■ ■ ■ Xj^iXj = O'^Ua^j+i ■ ■ ■ a;j_iO'^^l , and 

■ ■ ■ Xj^i 

so that |/ci — < 1 as a consequence of the balance property. By the arbitrary 
choice of i and j, it follows X C {0, 0^1, O'^+H} for some h > 0, which settles this 
case. 

2. If a > /3, w and w' differ exactly in all positions n such that {na + p} G /i U I2, 
with Ii = [1 — a — (3,1 — a[ and /2 = [1 — (3, 1[. Note that if {na + p} G /i, then 
{{n + l)a + p} G h- Hence, ii p ^ h we derive X C {0, 1, 01}. 

If p G I21 let (nj)j>o be the increasing sequence of all positive integers such that 
{riia + p} G /2. For alH > we have {riia + p + /3}</3, so that by Theorem 13.21 
it follows that the set {rij+i — nj | i > 0} is contained in {ki, k2, ki + k2} for some 
distinct integers /ci,/c2 (both greater than 1, as a < 1 — For all i > 0, Xi is 
the factor of w starting with the n^-th letter and ending in the (nj+i — l)-th one, 
since it can be written as ImO for some u such that the corresponding factor of w' 
is OmI. Thus, the elements of X may have length ki, k2, or ki + k2. There cannot 
be two of the same length, since any factor of w which is an upper Christoffel 
word of length > 2 can be written as luO where m is a palindromic prefix of the 
characteristic word of slope a. Hence X has cardinality 2 or 3. 

Let now X = {yi,y2, z} with \z\ = ki + k2 and \yi\ = ki for i = 1,2; we can write 
z = IvO and yi = lUiO. Since ui, U2, and v are all palindromic prefixes of the 
characteristic word, Ui is a prefix and a suffix of v for i = 1,2. Hence 

V = Uiahu2 = U2haui (5) 

for some letters a,b & {0, 1}. If a 7^ 6, it follows either z = yiy2 01 z = y2yi and 
we are done. 

By contradiction, let us then suppose a = b. From a < 1/2 we derive that 
a = b = 0, and that both Ui and U2 start with the letter 0. If both were a power 
of we would reach a contradiction, as no point of the (dense) sequence {na + p} 
would lie in the nonempty interval [1 — a,l — P[. Hence there exists j G {1,2} 
such that Uj contains both and 1, so that by Corollarv 11.21 either UjOl or ujlO is 
a prefix of v. This is a contradiction, since we are assuming (|5]) with a = b = 0. 
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When w and w' are both upper mechanical words, the proof is symmetricaL 

Let us now suppose that both w and w' are singular, and that only one of them is 
a lower mechanical word. By contradiction, suppose that X has more than 3 elements. 
This means that there exists j > 3 such that the set {xi, 0:2, . . . , Xj} C X has cardinality 
4. The word r = Xi - ■ -Xj is prefix of infinitely many nonsingular Sturmian words of 
slope a, and so is r' := xi ■ ■ ■ Xj . Let t, t' be two such nonsingular extensions, of r and r' 
respectively. The RC factorization of t and t' has more than 3 distinct terms, but t and 
t' are both lower mechanical words, a contradiction because of what we proved above. 
The same argument by contradiction proves that if X has cardinality 3, then its longest 
element has to be a concatenation of the other two. □ 

3.2 RC factorizations as Sturmian or 3-iet words 

Theorem 13.11 allows to consider RC factorizations as infinite words on the finite alphabet 
X. In this section we analyze the structure of such words. 

We recall that two finite words m, v are conjugate if m = A/i and v = fiX for some 
words A, fi. The following characterization of Sturmian morphisms was proved in [8, 
Theorem A.l]: 

Theorem 3.3 (Berthe, de Luca, and Reutenauer [sl). A morphism f : A* ^ A* is 
Sturmian if and only if it maps the three Christoffel words 01, 001, and Oil to conjugates 
of Christoffel words. 

Corollary 3.4. Ifu, v, and uv are Christoffel words, then the morphisms 1— )■ m, 1 1— )■ f 
and f , 1 (— )■ M are Sturmian. 

Proof. Immediate consequence of Proposition 11.31 and the previous theorem. □ 

Let us recall a further result (0, Proposition 2.3.2]) on the Sturmian morphisms ip 
and from 

Proposition 3.5 (Berstel and Seebold (HI). Let w he an infinite word. 

1. If (f(w) is Sturmian, then so is w. 

2. If ip{w) is Sturmian an w starts with 0, then w is Sturmian. 

Given a set X C A* and a word w G X*, by an abuse of language we say that 
a factorization w = X1X2 ■ ■ ■ , with Xj G X for all i, is a word over the alphabet X, 
identifying it with the word X1X2 ■ ■ ■ in the free monoid over X (or with the word 
f~^{xi)f~^{x2)---, where / is a bijection from a new alphabet B to X). The same 
identification is made also for factorizations of infinite words. 

A complete return to f G A* is a finite word containing exactly two occurrences of 
V, one as a prefix and one as a suffix. If u; is a finite or infinite word and f is a factor 
of w, then a (right) return word to v in w is a word r such that rv G Fact(ti') is a 
complete return to v. Left returns can be defined similarly, i.e., replacing rv with vr in 
the definition. 
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The following result is a known consequence of a theorem by Vuillon |l6l | characterizing 
Sturmian words as the ones having exactly two return words for each factor: 

Proposition 3.6 (cf. [12]). Let p be a prefix of a Sturmian word w. The sequence of 
return words to p in w is Sturmian; that is: 

1. p has exactly two distinct return words in w, say u and v, and 

2. given the morphism fp : u,l ^ v, the word Wp G A'^ such that fp{wp) = w is 
Sturmian. 

We can now begin to shed light on the structure of RC factorizations. 

Proposition 3.7. Let w = xiX2 ■ ■ ■ and X be defined as in Theorem \3.1\ Suppose 
X = {u, V, z} with z = uv. Then: 

1. The factorization w = yiy2--- , obtained from the starting RC factorization by 
replacing each occurrence of z with u ■ v (that is, defined so that for all i with 
Xi = z, there exists j with yi ■ ■ -yj-i = Xi ■ ■ -Xi^i, yj = x, and yj+i = v) is also 
reversible Christoffel. 

2. The new factorization yiy2 ■ ■ ■ is also a Sturmian word on the alphabet {u^v}. 

Proof. The result is trivially verified when {u,v} = {0, 1}. Let us then suppose this is 
not the case; by Proposition 1 1 . 3 1 we deduce that either -u is a prefix of v, or f is a suffix of 
u. Distinguishing such two cases, we shall first prove our second claim, i.e., that the new 
factorization defines a Sturmian word w = f~^{yi)f^^{y2) ' where / : m, 1 i— t". 

• If u is a prefix of f , let n > be the greatest integer such that m" is a prefix of v. 
Clearly is a prefix of w; we shall prove that u and v are the return words to 

in w, thus showing that w is indeed Sturmian by Proposition 13.61 

It is easy to check that and vu^ are indeed factors of w. Since u is primitive, it 
is clearly a return word to u^. Let v = u^u' for some u' & A* . By Proposition lL3[ u' 
and uu' are Christoffel words, so that we have either {m, u'} = {a, b} or u' G Suff(M) 
(m cannot be a prefix of u' by the maximality of n) . Clearly is a prefix and a suffix 
of f = u"'u'u"'; we need to show that has no other occurrences. If {u, u'} = A, 
this is trivial. If u' is a suffix of u, then cannot have internal occurrences in 
u^u'u^ because u is unbordered. Therefore u and v are return words to in w, 
so that w is Sturmian. 

• If i; is a suffix of u, let m > be the greatest integer such that f"^ is a suffix of 
u. The same argument as above shows that v^u and ti^+i are complete returns 
to v"^. Thus, writing u = v'v"^ for some v' G A*, we get that v and v"^v' are the 
return words to v"^ in the word f ™w. The sequence of these return words is again 
determined by w, as v'^w = (y'(w) with (7 : v"^v', 1 v. Hence, to prove that 
w is Sturmian, by Proposition 13.61 we only need to show that v"^w is Sturmian. 
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By mirroring the argument used in the proof of Proposition 12. ![ we get that since 
{w, w'} 7^ {Oc, Ic}, there exist arbitrarily long Abelian equivalent words r, r' such 
that rw and r'w' are Sturmian. By Theorem 13. the set of terms in the RC 
factorization of rw determined by r'w' cannot be larger than X = {u, v, z}. Since 
u = v'v^ and z = t>'t>™'+^, any sufficiently long r will have v"^ as a suffix. We have 
thus proved that v"^w = g{w) is Sturmian, and so is w. 

Since w is Sturmian, and the morphism / : t— )■ tt, 6 i— )■ {; is Sturmian by Corollary 13. 4[ 
the word /(w) is Sturmian too, so that the new factorization is actually the RC factor- 
ization of w determined by /(w). This completes the proof. □ 

Remark 3.8. If u, v, and uv are the terms of an RC factorization w = X1X2 ■ ■ ■ , then 
by Proposition 11.31 exactly one among u'^v and uv"^ is a factor of w. As a consequence 
of Proposition 13. 7^ it is easy to see that if u'^v (resp. uv'^) is a factor of w, then each 
occurrence of v (resp. u) in the factorization is preceded by u (resp. followed by v), 
provided that Xi 7^ v. This gives rise to the following "converse" of Proposition 13.71 

Proposition 3.9. Letu, v, andw = X\X2 ■ ■ ■ be as above. Replacing in the factorization 
each occurrence of u ■ v with one of z = uv ( that is, defining i/n for n > so that for 
all i where Xi = u and Xj+i = v there exists j with Xi ■ ■ ■ = yi - ■ ■ yj-i and yj = z) 
produces a new RC factorization of w, which is also a Sturmian word (over {z,u} or 
{z,v}). 

Proof. Let us first assume u'^v G Fact(w). By the previous Remark, it is clear that 
the replacement yields a factorization of w where v does not appear. Hence, the new 
factorization can be seen as an infinite word on the alphabet {u,z}, i.e., the image of 
some word w G A"^ under the morphism h : ^ z,l u. To see that w is Sturmian, by 
Proposition 13.51 it suffices to observe that ip{w) = w, where w is given by Proposition 13.71 
and is the Fibonacci morphism as in (jl]). 

To show that this new factorization is actually RC, once again we just need to observe 
that it is obtained by Abelian comparison with the word h{w), where the morphism 
h : ^ z,l ^ u is Sturmian by Corollary 13.41 

Now suppose Mf^ G Fact(w) instead. Essentially the same argument as above applies; 
we consider the new factorization as the image of an infinite word w under the morphism 
/i : I— )■ z, 1 I— )■ f . We have {E o <^)(w) = w, and w starts with 0, so that w is Sturmian 
by Proposition 13. 5t the factorization is RC since it is obtained by Abelian comparison 
with h{w), where /i : i— )■ 5, 1 1— )■ ?) is Sturmian by Corollary [331 CH 

We recall (cf. 0) that a 3-iet word is an infinite word coding the orbit of a point 
p under a three-interval exchange transformation T. More precisely, given an interval 
/ = [0, £] C M containing p and subdivided in three intervals /„ = [0, a], h = [a, a + P], 
and Ic = [a + f3, i], we let T be the piecewise linear transformation of / exchanging the 
three subintervals according to the permutation (321), i.e., let T :/—)•/ be defined by 
^(0 = ^ + if ^ ^ Ix, where x G {a, b, c} and 

ta = i — a, tb = i — 2a — /3, and tc = —a — (3 . 
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The 3-iet word determined by a, /3, i, and p is then the infinite word indexed over N 
whose n-th letter is x G {a, b, c} if T^{p) G Ix- 

Let (T, a' : {a, 6, c}* — A* be morphisms defined by 

a{a) = = cT'(a), a{h) = 01, = 10, a{c) = 1 = (t'(c) . (6) 

The following result was proved in (2], Theorem A] : 

Theorem 3.10 (Arnoux, Berthe, Masakova, and Pelantova [2]). An infinite word u on 
the alphabet {a,b,c}, whose letters have positive frequencies, is an aperiodic 3-iet word 
if and only if a{u) and (t'(u) are Sturmian words. 

As a consequence, we get that RC factorizations are in general 3-iet words: 

Corollary 3.11. Let w he a Sturmian word, and w = X1X2 ■ ■ ■ be an RC factorization 
with X = {xn \ n > 0} = {u,v, z}, z = uv. If every word of X occurs more than once 
in the factorization, then X1X2 ■ ■ ■ is an aperiodic 3-iet word over the alphabet {u, z, v}. 

Proof. Let r : {a,b,c} — {u,v,z} be defined by r(a) = u, T{b) = z, and r(c) = v. We 
need to show that the infinite word w := t^^{xi)t~^{x2) ■ ■ ■ is a 3-iet word. Clearly, 
exchanging the roles of w and w' (and letting r(a) = u etc.) does not change w; 
therefore, without loss of generality, in the following we can assume by Proposition 12.41 
that X is made of lower Christoffel words. 

Let a be the slope of w and w', and let p and p' respectively be their intercepts. As 
is well known, any factor 7 of w corresponds to an interval on the unit circle, i.e., 7 
occurs at position n in w if and only if {na + p} G Let now 7 G X, and suppose 
first that I7I > 1. The relation 

{na p} e I^n {I^ - p' p mod 1) (7) 

identifies the positions n of all occurrences of 7 in w such that 7 occurs at the same 
position in w'. As I7I > 1, we have 7 = Ogl and 7 = IgO for some word q. We claim that 
all occurrences of 7 whose position satisfies ((Tj) appear in the RC factorization. Indeed, 
if one of such occurrences did not correspond to Xi for any i > 1, then the first of Oql 
(resp. 1 of IgO) would have to be the last letter of some Xj (resp. Xj), against the fact 
that all Xj's are lower Christoffel words by Proposition 12.41 

Now suppose 7 is a letter. Without loss of generality, we can assume a < 1/2. If 
7 = 1, then necessarily X = {0, 1, 01}, so that ([7]) again identifies exactly all occurrences 
of 7 in w that appear in the RC factorization. Let then 7 = 0, so that X (or its mirror 
image) is {0, 0"1, 0""^^1} for some n > 0. As a consequence of the balance condition, it 
is easy to see that a position n where occurs in both w and w' corresponds to Xi for 
some 2 > 1 if and only if it is followed in w by Xj+i = 0"1. Hence, such positions are 
exactly those that satisfy 

{na + p} G /o"+ii n (/oio" ~ P + P mod 1) . (8) 

In all cases and for all 7 G X, we have identified intervals corresponding to the 
occurrences of 7 in the RC factorization, namely the ones in ([7]) or ([8]). By hypothesis. 



11 



7 occurs at least twice in the factorization; by the irrationahty of a, such intervals 
must then have nonempty interior, so that the gaps between consecutive integers n 
satisfying ([7]) or (|H]) are bounded. Thus, every term in the RC factorization, and so 
every letter in w, occurs with positive frequency. 

By Theorem I3.10[ it remains to prove that the two words 0"(w), (t'(w) are Sturmian, 
where a and a' are the morphisms defined in ([6]). In fact, it is easy to check that (t(w) = 
w and (t'(w) = w', i.e., the words (j(w) and (j'(w) coincide with the Sturmian words 
obtained in the proof of Proposition 13. 7^ applied respectively to the RC factorizations 
w = X1X2 ■ ■ ■ and w' = X1X2 ■ ■ ■ ■ The result follows. □ 

Remark 3.12. The hypothesis that every word of X occurs more than once in the 
factorization is necessary. For instance, it is easy to see that for the RC factorization 
X1X2 ■ ■ - Xn - ■ ■ of w = OlOf determined by w' = lOOlf, one has X = {0, 01, 001}, but 
the term occurs exactly once, as X2- In fact, by Proposition 13. 7[ the word f = X3X4 ■ ■ ■ 
is Sturmian over the alphabet {01,001}. 

4 Future work 

We believe that much can still be said about reversible Christoffel factorizations; for 
example, it would be interesting to characterize the set of terms X in terms of the 
slope a and the difference between the intercepts, in particular to distinguish when the 
cardinality of X is 2 or 3. 

Acknowledgment 

The second author would like to thank the Department of Mathematics at the University 
of Turku for its support during his part-time employment there in 2011-2012 as a member 
of the FiDiPro unit. 

References 

[1] p. Alessandri and V. Berthe. Three distance theorems and combinatorics on words. En- 
seign. Math. (2), 44:103-132, 1998. 

[2] P. Arnoux, V. Berthe, Z. Masakova, and E. Pelantova. Sturm numbers and substitution 
invariance of 3iet words. Integers, 8:A14, 17, 2008. 

[3] M. Barge and B. Diamond. Coincidence for substitutions of Pisot type. Bull. Soc. Math. 
France, 130:619-626, 2002. 

[4] J. Berstel and A. de Luca. Sturmian words, Lyndon words and trees. Theoret. Comput. 
Sci., 178:171-203, 1997. 

[5] J. Berstel and P. Seebold. Sturmian words. In M. Lothaire, editor, Algebraic Combinatorics 
on Words. Cambridge University Press, Cambridge UK, 2002. Chapter 2. 



12 



[6] J. Berstel, A. Lauve, C. Reutenauer, and F. Saliola. Combinatorics on Words: Christoffel 
Words and Repetition in Words, volume 27 of CRM monograph series. American Mathe- 
matical Society, 2008. 

[7] V. Bertlie, H. Ei, S. Ito, and H. Rao. On substitution invariant Sturmian words: an 
application of Rauzy fractals. Theor. Inform. AppL, 41(3):329-349, 2007. 

[8] V. Berthe, A. de Luca, and C. Reutenauer. On an involution of Christoffel words and 
Sturmian morphisms. European J. Combin., 29:535-553, 2008. 

[9] M. Bucci, A. De Luca, and L. Q. Zamboni. Reversible Christoffel factorizations. In 
First Russian-Finnish Symposium on Discrete Mathematics (RuFiDiM), pages 13-15, St. 
Petersburg, Russia, 2011. Euler International Mathematical Institute. 

[10] M. Bucci, S. Puzynina, and L. Q. Zamboni. Central sets defined by words of low factor 
complexity. arXiv:1110.422 5, 2011. URL http : //arxiv . org/abs/ 1110.4225 , 

[11] A. de Luca. Sturmian words: structure, combinatorics, and their arithmetics. Theoret. 
Comput. Sci., 183:45-82, 1997. 

[12] J. Justin and L. Vuillon. Return words in Sturmian and episturmian words. Theor. Inform. 
AppL, 34(5):343-356, 2000. 

[13] M. Morse and G. A. Hedlund. Symbolic dynamics II. Sturmian trajectories. Amer. J. 
Math., 62:1-42, 1940. 

[14] N.B. Slater. The distribution of the integers n for which {9n} < (p. Proc. Cambridge 
Philos. Soc, 46:525-534, 1950. 

[15] V.T. Sos. On the distribution mod 1 of the sequence na. Ann. Univ. Sci. Budapest, Eotvos 
Sect. Math, 1:127-134, 1958. 

[16] L. Vuillon. A characterization of Sturmian words by return words. European J. Combin., 
22(2):263-275, 2001. 

[17] S. Yasutomi. On Sturmian sequences which are invariant under some substitutions. In 
Number theory and its applications (Kyoto, 1997), volume 2 of Dev. Math., pages 347-373. 
Kluwer Acad. Publ., Dordrecht, 1999. 



13 



